Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task

نویسندگان

  • Kristen Parton
  • Kathleen McKeown
  • Bob Coyne
  • Mona T. Diab
  • Ralph Grishman
  • Dilek Z. Hakkani-Tür
  • Mary P. Harper
  • Heng Ji
  • Wei-Yun Ma
  • Adam Meyers
  • Sara Stolbach
  • Ang Sun
  • Gökhan Tür
  • Wei Xu
  • Sibel Yaman
چکیده

Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation (MT). In this paper, we present an error analysis of a new cross-lingual task: the 5W task, a sentence-level understanding task which seeks to return the English 5W's (Who, What, When, Where and Why) corresponding to a Chinese sentence. We analyze systems that we developed, identifying specific problems in language processing and MT that cause errors. The best cross-lingual 5W system was still 19% worse than the best monolingual 5W system, which shows that MT significantly degrades sentence-level understanding. Neither source-language nor targetlanguage analysis was able to circumvent problems in MT, although each approach had advantages relative to the other. A detailed error analysis across multiple systems suggests directions for future research on the problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Multiple Approaches to the Cross-Lingual 5W Task

Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation (MT). In this paper, we present an error analysis of a new cross-lingual task: the 5W task, a sentence-level understanding task which seeks to return the English 5W's (Who, What, When, Where and Why) corresponding to a Chinese sentence. We analyze systems...

متن کامل

The 5W Structure for Sentiment Summarization-Visualization-Tracking

In this paper we address the Sentiment Analysis problem from the end user’s perspective. An end user might desire an automated at-a-glance presentation of the main points made in a single review or how opinion changes time to time over multiple documents. To meet the requirement we propose a relatively generic opinion 5Ws structurization, further used for textual and visual summary and tracking...

متن کامل

Wh-Questions\' Expression in Persian-speaking Children: A Comparison Between Spontaneous and Elicited Probes

Objectives: Studies have shown that most children before the age of 5 are capable to comprehend and express wh-questions in daily conversations. This study aimed at comparing the ability of wh-questions’ production in 4- to 6-year-old children in spontaneous and elicited conditions. Methods: In this descriptive-analytic study, 4- to 6-year-old Persian-speaking children (N = 72) were selected r...

متن کامل

Giveme5W: Main Event Retrieval from News Articles by Extraction of the Five Journalistic W Questions

Extraction of event descriptors from news articles is a commonly required task for various tasks, such as clustering related articles, summarization, and news aggregation. Due to the lack of generally usable and publicly available methods optimized for news, many researchers must redundantly implement such methods for their project. Answers to the five journalistic W questions (5Ws) describe th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009